On the Importance of the Distance Measures Used to Train and Test Knowledge-Based Potentials for Proteins
نویسندگان
چکیده
Knowledge-based potentials are energy functions derived from the analysis of databases of protein structures and sequences. They can be divided into two classes. Potentials from the first class are based on a direct conversion of the distributions of some geometric properties observed in native protein structures into energy values, while potentials from the second class are trained to mimic quantitatively the geometric differences between incorrectly folded models and native structures. In this paper, we focus on the relationship between energy and geometry when training the second class of knowledge-based potentials. We assume that the difference in energy between a decoy structure and the corresponding native structure is linearly related to the distance between the two structures. We trained two distance-based knowledge-based potentials accordingly, one based on all inter-residue distances (PPD), while the other had the set of all distances filtered to reflect consistency in an ensemble of decoys (PPE). We tested four types of metric to characterize the distance between the decoy and the native structure, two based on extrinsic geometry (RMSD and GTD-TS*), and two based on intrinsic geometry (Q* and MT). The corresponding eight potentials were tested on a large collection of decoy sets. We found that it is usually better to train a potential using an intrinsic distance measure. We also found that PPE outperforms PPD, emphasizing the benefits of capturing consistent information in an ensemble. The relevance of these results for the design of knowledge-based potentials is discussed.
منابع مشابه
The Role of Knowledge Management Elements in the Improvement of the Faculty Members in Distance Education Universities) designing an appropriate model (
Background and Objective: Given the importance and status of faculty members in universities, the advancement of the duties and missions of the higher education system and rapid development of the technologies and challenges faced by educational institutions require proper measures for the continuous development and overall improvement of these systems, especially the improvement of the capacit...
متن کاملAutomatic Interpretation of UltraCam Imagery by Combination of Support Vector Machine and Knowledge-based Systems
With the development of digital sensors, an increasing number of high-resolution images are available. Interpretation of these images is not possible manually, which necessitates seeking for practical, fast and automatic solutions to solve the environmental and location-based management problems. The land cover classification using high-resolution imagery is a difficult process because of the c...
متن کاملCFD Simulation of High-speed Trains: Train-induced Wind Conditions on Trackside Installations
Speed is the created air flow as well as slipstream effects as the trains move. These effects can have some level of impact on fuel and energy efficiency of the train, but their other important outcome is the emergence of turbulent flows at higher speeds which can cause aerodynamic drag forces followed by noise and vibration. Thus, slipstream effects have significant importanc...
متن کاملبهکارگیری تحلیل زمان- فرکانس و ماشین همیار درتشخیص خودکار مؤلّفهی P300 جهت ارتباط مغز با رایانه
Abstract: In this study we propose a new approach to analyze data from the P300 speller paradigm using the quadratic B-Spline wavelet coefficients in comparing to time and frequency features sets on the event related potentials. Data set II from the BCI competition 2005 was used. Mode frequency, Mean frequency, Median frequency and some morphologic parameters ware extracted as features. Three m...
متن کاملAssessment of Females’High Schools in District One of Tehran Based on the Sustainable School Indicators
The data of this study was collected from 252 teachers, principals and vice-principals offemales’ high schools in district one through one researcher-administered questionnaire, and was then analyzed. The validity of the questionnaire was approved by seven university professors. Cronbach’s alpha was used in order to measure the reliability of the questionnaire. The total calculated alpha was 0....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 9 شماره
صفحات -
تاریخ انتشار 2014